Imitation Learning with Concurrent Actions in 3D Games
نویسندگان
چکیده
In this work we describe a novel deep reinforcement learning neural network architecture that allows multiple actions to be selected at every time-step. Multi-action policies allows complex behaviours to be learnt that are otherwise hard to achieve when using single action selection techniques. This work describes an algorithm that uses both imitation learning (IL) and temporal difference (TD) reinforcement learning (RL) to provide a 4x improvement in training time and 2.5x improvement in performance over single action selection TD RL. We demonstrate the capabilities of this network using a complex in-house 3D game. Mimicking the behavior of the expert teacher significantly improves world state exploration and allows the agents vision system to be trained more rapidly than TD RL alone. This initial training technique kick-starts TD learning and the agent quickly learns to surpass the capabilities of the expert.
منابع مشابه
Efficient Reductions for Imitation Learning
Imitation Learning, while applied successfully on many large real-world problems, is typically addressed as a standard supervised learning problem, where it is assumed the training and testing data are i.i.d.. This is not true in imitation learning as the learned policy influences the future test inputs (states) upon which it will be tested. We show that this leads to compounding errors and a r...
متن کاملIs Bayesian Imitation Learning the Route to Believable Gamebots?
As it strives to imitate observably successful actions, imitation learning allows for a quick acquisition of proven behaviors. Recent work from psychology and robotics suggests that Bayesian probability theory provides a mathematical framework for imitation learning. In this paper, we investigate the use of Bayesian imitation learning in realizing more life-like computer game characters. Follow...
متن کاملBorn to Learn: What Infants Learn from Watching Us
Imitation is a powerful form of learning commonly used by children, adults and infants. A child's enthusiasm for imitative behavior prompts parental attention and interaction, and provides a mechanism for transmitting appropriate cultural and social behavior. Although simple imitative behavior is evident in the postnatal period, by around 14 months infants remember and repeat actions they obser...
متن کاملLearning Strategies for Coordination of Multi Robot Systems: a Robot Soccer Application
This paper presents a hybrid method for learning a dynamic strategy for a robot soccer team. In this method, an imitation learning scheme based on observed robot soccer games is used as a seed for an experience-guided learning scheme based on reinforcement learning. A lack in the application of classic reinforcement learning to the robot soccer problem is the high number of states to be analyze...
متن کاملBelievability Testing and Bayesian Imitation in Interactive Computer Games
In imitation learning, agents are trained to carry out certain actions by examining a demonstration of the task at hand. Though common in robotics, little work has been done in translating these concepts to computer games. Given that present-day games generally use antiquated AI techniques which can often lead to stilted, mechanical and conspicuously artificial behaviour, it seems likely that a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2018